To appear in: 

Rendiconti dell'Istituto Lombardo Accademia di Scienze e Lettere, Classe di Scienze. 



ON A THEOREM OF LYAPOUNOV 

ANTONIO GIORGILLI 

Dipartimento di Matematica, Via Saldini 50, 
20133 — Milano, Italy. 



Sunto. Si mostra che un sistema Hamiltoniano nell'intorno di un punto di equilibrio, sotto 
condizione che gli autovalori soddisfino delle condizioni di non-risonanza del tipo di Melnikov, 
ammette una forma normale che rende evidente l'esistenza di una varieta invariante (locale) 
a due dimensioni sulla quale si hanno soluzioni note. Nel caso di un autovalore puramente 
immaginario tali soluzioni formano una famiglia periodica a due parametri che costituisce la 
continuazione naturale di un modo normale. Questo secondo risultato e stato dimostrato in 
precedenza da Lyapounov. In questo lavoro si completa quello di Lyapounov dimostrando la 
convergenza della trasformazione dell'Hamiltoniana a forma normale e rimuovendo le restrizione 
che gli autovalori siano puramente immaginari. 



Abstract. It is shown that a Hamiltonian system in the neighbourhood of an equilibrium may 
be given a special normal form in case the eigenvalues of the linearized system satisfy non- 
resonance conditions of Melnikov's type. The normal form possesses a two dimensional (local) 
invariant manifold on which the solutions are known. If the eigenvalue is pure imaginary then 
these solutions are the natural continuation of a normal mode of the linear system. The latter 
result was first proved by Lyapounov. The present paper completes Lyapounov's result in that 
the convergence of the transformation of the Hamiltonian to a normal form is proven and the 
condition that the eigenvalues be pure imaginary is removed. 



1. Introduction 



Consider a canonical system of differential equations in a neighbourhood of an equilib- 
rium, with Hamiltonian 



1) H{x,y) = H (x,y) + H 1 (x,y) + ... , (x,y) e C 



2n 
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where the unperturbed quadratic part of the Hamiltonian is 

n 

(2) H (x,y) = J2^jXjVj , (Ai,...,A n ) G C n , 

3=1 

and H s (x, y) for s > 1 is a homogeneous polynomial of degree s + 2. The form (2) is a 
typical one for the quadratic part of a Hamiltonian system in the neighbourhood of an 
equilibrium, as under quite general conditions the system may be given that form via a 
(complex) linear canonical transformation (see, e.g., [10] or [12], §15). 

The Hamiltonian is assumed to be analytic in some neighborhood of the origin of 
C 2n . Moreover Ai will be assumed to satisfy at least the first of the following non- 
resonance conditions: 

(i) First Melnikov's condition: 

(3) X„ — k\i 7^ for k G Z and v = 1, . . . , n . 

(ii) Second Melnikov's condition: 

(4) X v ± \ u > — k\i 7^ for k G Z and v, v' = 1, . . . , n , 

the case v' = v being included. 
In [10] Lyapounov proved that if Ai = iu\ is pure imaginary and the non resonance 
condition (i) above is satisfied then there exists a two parameter family of solutions of 
the form 

(5) Xj = (fij(^i,Vi) , Vj = ^j(ti,Vi) 
written as convergent power series in the arguments 

(6) £i=eie itol(0 , V i = Vie- itaia) , 

where a\(Q = Xi + ■ ■ ■ is a convergent power series in (j = £i?7i. In the case n = 1 this 
actually describes all solutions of the system. A proof in case all A's are pure imaginary 
is reported in [12]. 

The proof of the theorem is worked out by the authors quoted above by expanding 
the solution in the form (5) and proceeding by comparison of coefficients. From a formal 
viewpoint the statement above looks equivalent to the existence of a canonical trans- 
formation that gives the system (1) a suitable normal form, making the Hamiltonian 
to depend at least quadratically on X2, ■ ■ ■ , x n , y%, . . . , y n . A formal construction giving 
such a normal form can be easily produced. However, proving the convergence of the 
normalization procedure seems to be more difficult. The aim of this paper is precisely 
to produce a proof of convergence of the transformation to normal form. 

I will actually give two different statements that can be proved with the same 
method. The first one is 

Theorem 1: With the nonresonance hypothesis (i) above (first Melnikov's condition) 
on Ai, . . . , X n , there exists a canonical, near the identity transformation in the form of 
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a power series convergent in a neighbourhood of the origin, which gives the Hamilto- 
nian (1 ) the normal form 

(7) H(x, y) = H (x, y) + T(x iyi ) + F(x, y) , 

where Ho(x,y) as in (1), T(xiyi) depends only on the product x\y\, and F(x,y) is at 
least quadratic in X2, ■ ■ ■ , x n , y2, ■ ■ ■ ,y n 

The existence of the Lyapounov orbits for Ai pure imaginary is evident from the 
normal form: just put initially X2 = • • • = x n = y2 = . . . = y n = 0, which defines a 
local invariant two dimensional manifold on which the dynamics is generated by the 
Hamiltonian XiXiyi + T(xiyi). The advantage of the normal form is that it allows also 
to investigate the dynamics in the neighbourhood of the orbits so found. To this end 
the following statement may be even more useful. 

Theorem 2: With the nonresonance hypotheses (i) and (ii) above (first and second 
Melnikov's conditions) on Ai, . . . , A n , there exists a canonical, near the identity trans- 
formation in the form of a power series convergent in a neighbourhood of the origin, 
which gives the Hamiltonian (1 ) the normal form 

(8) H(x, y) = H (x, y) + T(x 1 y 1 , . . . , x n y n ) + F(x, y) , 

where H (x,y) as in (1), T(x,y) contains only monomials of the form x\y\x v y v 
with a positive integer j and with v = 2, ...,n, and F(x,y) is at least cubic in 
X2,...,x n ,y2,...,y n 

This requires a stronger non-resonance condition. However this normal form may be 
more convenient if one is interested in the stability of a Lyapounov orbit. Indeed, let all 
A be pure imaginary, say Xj = iuij, and write the Hamiltonian restricted to the invariant 
manifold X2 = ■ ■ ■ = x n = j/2 = • • • = Vn = in action-angle variables by transforming 
x i = ^/p^ iq , 2/i = —iyjpe tq . Thus one gets the Hamiltonian uip+T(p), which represents 
a non linear oscillator, with orbits written as p(t) = p* , q(t) = q(0)Q(p*)t, where 
&(p*) = w i + 0(p*) is a fixed frequency. By a translation p' = p — p* the Hamiltonian 
may be reexpanded (omitting primes) as 

n 

H(q,p, x,y) = ttp + ^2 ^i x jVi + H t + H 2 + . . . 
i=2 

where H s is a homogeneous polynomial of degree s+2 in p 1 / 2 , X2, ■ ■ ■ , y n with coefficients 
periodically depending on q. The dynamics of the latter Hamiltonian may be investigated 
with known methods from perturbation theory. The advantage with respect to the 
normal form of theorem 1 is that the quadratic part of the Hamiltonian is independent 
of the angle q. 

The proof is based on a previous work by the author [8] concerning the construction 
of the normal form in a case investigated by Cherry [2] and Moser [11]. It must be 
stressed that this problem does not involve small divisors. Rather, the possible source 
of divergence is due to the use of Cauchy's estimates for the derivatives required by the 
normalization algorithm. The global effect of accumulation of derivatives is controlled 
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with a technique introduced by the author and U. Locatelli in order to achieve a proof of 
KAM theorem using classical expansions in a perturbation parameter (see [4], [5], [6], [7]). 

2. Formal algorithm 

Reducing the Hamiltonian to a normal form is a quite general problem which may be 
solved in a number of different ways. Moreover, the concept of "normal form" may 
assume a quite general meaning, depending on what one is looking for. Here I state the 
algorithm in a general form, using the method of composition of Lie series. 

2.1 The algorithm for the normal form 
Write the Hamiltonian after r normalization steps as 

(9) H^(x, y) = H (x, y ) + Z 1 (x,y) + ... + Z r (x, y) + £ H^(x, y) , 

where Z\(x, y), . . . , Z r (x, y) are in normal form, whatever it means, and are homoge- 
neous polynomials of degree 3, . . . , r + 2. For r = the Hamiltonian (1) is considered 
to be already in the wanted form, with no functions Z . 

Assume that the Hamiltonian has been given a normal form (9) up to order r — 1, 
so that _£/"( r_1 ) is known. The generating function \ r and the normal form Z r are 
determined by solving the equation 

(10) L HoXr + Z r = Hi r -V . 

where the common notation L v - := {•, </?} has been used. The solution of this equation 
depends on what is meant by "normal form" . At a formal level, any form that allows 
to solve the equation above for Z r and \ r is acceptable. Assume for a moment that a 
method of solution has been found. Then the transformed Hamiltonian is expanded as 

^ L l Z ™ + E ^ LP xMs-ll + m for r > 2 , , > 1 and 1 < m < r , 
p=0 p ' 

for r > 1 and s > 2 . 

The justification of the algorithm requires only some straightforward calculation, and 
is deferred to appendix A. 

Thus, the problem is how to solve the equation (10) for the generating function and 
the normal form. Let me make some general considerations. 

Let V s denote the linear space of homogeneous polynomials of degree s in the 
complex variables x,y. Let also V = Us>o^' s ' so that a formal power series is an 
element of V . A basis in V is given by the monomials x^y k := x°i ■ . . . ■ x^y^ 1 ■ . . . ■ y^ n , 
where j, k are integer vectors with non-negative components. The linear operator Lh 



H 



(r) 

sr+m 



(11) 



±J - sr 
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maps every space V s into itself. If, due to the choice of the coordinates, the unperturbed 
Hamiltonian Hq has the form (2) then the operator Lh is diagonal, since 

L Ho x J y k = (j - k, X)x j y k . 

The kernel and the range of L# are defined as usual, namely M = L^(0), the inverse 
image of the null vector in V, and TZ = Lh (V). Both Af and TZ are actually subspaces 
of the same space V, and it turns out that they are complementary subspaces, i.e., 
AfOTZ = {0}, the null vector, and M@TZ = V. A consequence of the properties above is 
that Lh restricted to the subspace TZ is uniquely inverted, i.e., the equation Lh X = V' 
with iJj G TZ admits an unique solution x satisfying the condition \ G TZ. That unique 
solution will be written as \ = LJ^ifj, i.e., is defined as the inverse of L# restricted 
to TZ. It's easy to identify the subspaces M and TZ using the coordinates. Thanks to the 
diagonal form of Ljj one has 

Af = span {arV : (j-k,X) = 0} , 
TZ = span {x j y k : (j -k,X}^0} . 

Given ip G TZ and writing ip = . k ipj :k x : >y k , with ipj^ = for x^y k G A/", one has 

In view of the general considerations above we can conclude that the choice of a 
normal form is subjected to the constraint that in equation (10) we have ^ — Z r G 
7?.. The simplest choice is to ask also Z r G A/", i.e., to set Z r to be the projection of 
on the subspace A/". This is known indeed as Birkhoff 's normal form. 

2.2 Normal form for Lyapounov's orbits 

I come now to show that the construction of the normal form of theorem 1 is formally 
consistent. Consider the disjoint subsets of Z n 



K> = {k e Z n 
(14) & = {k e Z n 

/C b = {k e Z n 



k 2 = . . . = k n = 0} , 

= 1} , 
>1} . 



\k 2 


+ . 


. + 


\k n \ 


\k 2 


+ . 


. + 


\k n \ 



One has Z n = /C" U U /C b , of course. Considering only integer vectors j, k with 
non-negative components, introduce the subspaces of V 

= span{xV : j + k G £*} 

(15) = span {x j y k : j + k G /C^} 

P b = span|x J / : j + A;G/C b } 

These subspaces are clearly disjoint, and moreover one has V = © © VK Finally, 
let A"* = A" fl "P* and TZ^ =TZnV^ and define the subspaces Z and W of 7> as 

(16) Z = A 1 © V b , W = ft* © 7^ . 
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It is an easy matter to check that Z D W = {0} and Z © W = V . The construction of 
bases for Z and W is quite straightforward: a monomial x^y k belongs to Z in either case 
(j + k E K} and (j -k,X) = 0) or (j + ke /C b ); else it belongs to W. The hypothesis (i) 
on A (first Melnikov's condition) formulated at the beginning of the introduction means 
that the non-resonance condition 

(17) (k, A) ^ for ^ k e K) U 

is satisfied. This implies W C 7Z, so that for every ip e W the unique solution x = LJ^t^, 
X G VV of the equation Lh x = V> exists. With this setting, the equation 

(18) L^ x + Z = *, 

with \1/ known, admits a straightforward solution. Split ^ = ^ z + with \1> z G 
Z and \I/yy; £ VV; such a decomposition exists and is unique, because Z and W are 
complementary subspaces. Then set Z = ^z, and determine x = ^h^w according 
to (13). 

2.3 Adding the second Melnikov's condition 

With minor changes one can also prove that the normal form of theorem 2 can be 
constructed. Let 

K> = {k e Z n : k 2 = . . . = k n = 0} , 

(19) K> = {k E Z n : \k 2 \ + . . . + \k n \ = 1, 2} , 

}C b = {keZ n : |fc 2 | + ... + |fc n | > 2} . 

One has again Z n = K) U U /C b , of course. The subspaces of "P are defined again as 
in (15), although they turn out to be different in view of the differences in the sets /C. 
Finally, let W* =AfnV i , =1lC\VK = N C\V^ and 1$ =TZnV\ and define the 
subspaces Z and W of V as 

(20) Z=M^®M^®V\ W = K*@T$. 

The difference with respect to the previous case is just that now J\f^ is not empty, 
because it contains all monomials of the form (xiyiY x x v y v with v = 2,...,n and 
positive j. This forces the change in the definition of the subspaces Z and W. However, 
the properties Z n W = {0} and © W = P remain true. Furthermore, in view of the 
second Melnikov's condition, also the property W C TZ holds true, so that for every 
ip G W the unique solution x = L^ip, x G W of the equation Lh X = V 7 exists. 



3. Quantitative estimates 

Pick a real vector R e R n with positive components, and consider the domain 
(21) A i? = {(x,y)GC n : |^|<^-, |^|<^-,l<i<n} , 
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namely a poly disk which is the product of disks of radii R\ , . . . , R n in the planes of the 
complex coordinates (xi, . . . , x n ) and (yi, . . . , y n ), respectively. Let also 

(22) A = min Rj . 

l<j<n 

The norm ||/||.r in the polydisk A R is defined as 

(23) ||/|| fl = \fj*\ Rj+k - 

jj'+fc|=r 

A family of polydisks Asr of radii 5R, with < 5 < 1 will be considered below. With 
a minor abuse the simplified notation || • in place of || • \\sr will be used. 
The main result of this section is 

Lemma 1: Let the Hamiltonian satisfy < h s ~ x E for s > 1, with some 

constants h > and E > 0. Let < d < 1/2. Then there exist positive constants (3 and 
G depending on E, h, A, d and on Ai, . . . , A n such that 

||Xr||i-d</3 r_1 G forallr>\. 

The rest of this section is devoted to the proof. Some technical calculations are deferred 
to appendix B. 

3.1 An arithmetic lemma 

The following lemma will play a crucial role in the proof of lemma 1 . 

Lemma 2: Let A e C n be such that Ai satisfies the non-resonance condition (3). Then 
there exists a positive 7 such that the inequality 

|(M>|>|% 

holds true for all non-zero k e /C" U /C 11 defined as in (14). 

Corollary 1: Let in addition the non-resonance condition (4) be satisfied. Then the 
same statement holds true for all non-zero fe G /C" U /C'' defined as in (19). 

The proof of the corollary is just a trivial modification of the 

Proof of lemma 2. For k e K} the claim is obvious, since \(k, A)| = |&iAi|. So, let 
k E Ks. Set $ = max(|A2|, . . . , |A n |) (for the corollary maximize also over ^A^l and 
|Aj, ± \ v '\ with v' 7^ v). Pick an integer N > I + 2$ and set 



S = min |(fc,A)| , 7 = min ( ] ; 

\k\<N N ' 



in view of the non-resonance condition (17) one has 5 > 0. Then the claim of the lemma 
holds true with the given value of 7. Indeed, let k G K,^, so that |fci| = \k\ — 1. If \k\ < N 
then \{k, A)| > 5 > ^7 > |fc|7- If > use f) < (N — 1)5/2, which follows from the 
choice of N, and evaluate 

|<M) I > \ki\i\--&> (\k\-l)S- (iV ~ 1) 5 
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> 



1 . N 



(iV-1) 



1^2 >l% 



Q.E.D. 



3.2 Generalized Cauchy estimates 

Here I refer to the more restrictive hypotheses of theorem 2, and in particular to the 
spaces V defined as in sect. 2.3. However, the same arguments with very little simpli- 
fications apply also to the setting of sect. 2.2, which applies to theorem 1. I will insert 
short comments in parentheses concerning the latter case, where appropriate. 

The estimates in this section strongly depend on a suitable splitting of all functions 
over the subspaces "P", and V b . At a formal level, it is useful to keep in mind the 
following table concerning the Poisson bracket: 



{•>•} 


pit 


pb 


p b 


pit 


pi 


pb 


p b 




ph 


ptt ©pb ©pb 


ph ©p^ 


<p\> 


p b 


ph ©p^ 


p b 



(For the subspaces defined as in sect. 2. 2 just remove from the central case corre- 
sponding to the Poisson bracket between functions in V^.) In view of the transforma- 
tion formulae (11) the situation to be considered is the following. A generating function 
X £ VV Pi V r with some r > 1 is given in the form x = LJj^tf), with known ip G W fl V r . 
Since W = ft*©7^ one has x = x" + X 11 > with an obvious meaning of the notation. The 
operator L x may be applied either to a generic function f = f$ + + f b E V s with 
s > r or to a function in normal form Z = Z$ + + £ Z fl V m with < m < r (in 
theorem 1 = 0). In particular one has 

j>l j>0,2<v<n 

due to the non-resonance conditions on A. For some non-negative 6', 5", 6 satisfying 
< max(5 / ,5 // ) < 5 < 1/2 the norms || , i/'|| 1 _ (5 , , ||/|| 1 _ (5 // and ||Z|| 1 _ 5 , / are assumed to 
be known, and one looks for an estimate of the Lie derivative in a domain A^.^. The 
following estimates will be used in the rest of the paper. 

(i) The generating function x is estimated by 

(25) llxlli_*<^Mi_* , 

with 7 as in lemma 2. 

(ii) The general estimate for the Lie derivative of a generic function / is 

( 26 ) \\ L xf\\i- 6 ^ ( S -6')(S-S")A 2 llxlll - <5 ' 
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with A as in (22). Denoting by (L^/^ the projection of L^f^ over one has 



(27) II^W^IL* < {6 _ S „ )A 2 Mis' 

(iii) For a function Z in normal form one has 



1-5' 



(28) 11^(^ + ^)11^ < {s _ 1_ Ml _ sl \\ Z 



1 



I recall the reader's attention on the missing denominator 5 — 5' in (27) and (28). This 
is crucial for the convergence proof. For, working out the convergence proof requires 
a quite accurate control of the accumulation of the divisors 5 — 5' ', 5 — 5" that appear 
in the generalized Cauchy estimates for derivatives. The scheme in the next section is 
specially devised in order to allow such a control. 

The proof of (25) is a straightforward consequence of the definition of the norm 
and of (13). For, the denominators are uniformly estimated from below by 7, in view of 
lemma 2. 

The proof of the estimates (26), (27) and (28) is a purely technical matter, and is 
deferred to appendix B. 

3.3 Recursive estimates 

The aim of this section is to obtain estimates for the norms of the generating functions 
and of the transformed Hamiltonians, at every step of the normalization procedure. 

Consider a sequence of boxed domains A(is r )R, where {5 r } r >i is a monotonically 
increasing sequence of positive numbers converging to some d < 1/2. Let also 5q = 0, 
and d r = 5 r — 5 r -\ for r > 1, so that d r < 1 for all positive r. The purpose is to look 
for estimates of the norms of the generating function \ r and of the normal form Z r in 
the polydisk A( 1 _§ r _ 1 ) R , and of the functions in the domain A(^i_g r ^ R . 

Let J rjS for 1 < r < s be the set of integer arrays defined as 

(29) Jr,s = {j = {ji,---3k} ■ j m G{l,...,r}, l<fc<2(a-l), 

k 

l0 §2 3m < 2(s - 1 - log 2 s)| . 

m=l 

Let also Jq jS = for s > 1. Recalling that {d r } r >i is a sequence of positive numbers 
not exceeding 1 define the sequence {T r)S }o< r < s as 

(30) T , a = l , T r , s = max J] dj 1 . 

J t J r.s 

jeJ 

The following properties will be used below: for < r < r' < s one has 

(31) T rjS < T r / jS , 

(32) ~d?^ — l,rTr',s 5: T r > r + s . 
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Checking (31) is easy: for r = use d\ < 1 for / > 1; for r > use the inclusion relation 
Jr,s C J r ' , s for r < r'. In order to prove (32) remark that by definition one has 



— rT r _i r T r i 8 = — max TT cL 1 max TT d-, 1 

n -o- 1 



= max max 

J£Jr-l,r J'GJV' s 

i6{r,r}UJUJ' 

It is enough to prove that {r, r} U J U J' =: J E J r ^ r+S . First check that 

#(J) = 2 + #(J) + #(J') < 2 + 2(r - 1) + 2(s - 1) = 2(r + a - 1) . 

On the other hand, since 1 < j < r — 1 for all j E J and 1 < j' < r' for all f E J', one 
also has 1 < j < r' for all j E J. Finally, evaluate 

log 2 j = 2 log 2 r + log 2 j+ lo S2 / 

jeJ jeJ j'eJ' 

< 2 log 2 r + 2(r — 1 — log 2 r) + 2(s — 1 — log 2 s) 
<2[r + s-l-(l + log 2 s)] 

< 2[r + s- 1 -log 2 (r + s)] , 

where the elementary inequality 1 + log 2 s = log 2 2 + log 2 s = log 2 (2s) > log 2 (r + s) has 
been used (recall that r < r' < s). Hence, J E ^T r ',r+s, as claimed. 
I shall also use the numerical sequence {/U r ,s}r>o,s>o defined as 

A*o,o = , ^ ,s = 1 for s > , 

( 33 ) ^ r , s = Y ^r-i,r^r-i, s -r P for r > and s > . 

0<rp<s 

The recursive estimates are collected in 

Lemma 3: Let the Hamiltonian satisfy \\Hs°^\\i < h s ~ 1 E for some constants 

h > and E > 0. Let do = 1 and {<i r } r >i be an arbitrary sequence of positive numbers 
satisfying ^2 r>1 d r = d with d < 1. Let also 5q = and 8 r = d\ + . . . + d r . Then for 
s > r > 1 the following estimates hold true: 

(34) ||Xr||l-<5 r _i < fJ>r-l,rT r -l,rC T — , 

7 

l 

(35) ||Z r ||i_<5 r _ 1 < \i r -\^ r T r -\^C r 



d 

(36) ||Z{ + Z^W^s^ < ^r-l,rT r -l,rC r ~ 1 E , 

(37) WH^h-^ < fir^sC 8 - 1 ^ , 

(38) HifW't + ffW^ll!^ < //r^C-^ , 



r-1 
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where 

4e 2 E 



(39) C = h+ ^ 2 , 

and p ryS and T ryS are the sequences defined by (30) and (33). 

Remark that (34), (36) and (38) differ from (35) and (37), respectively, only because a 
divisor d r is missing. 

Proof. By induction. For r = only (37) and (38) are meaningful, and hold true in 
view of do = ^o,s = T 0jS = 1 and of h < C. The induction consists in first proving 
that if (37) and (38) hold true up to r — 1 then (34), (35) and (36) are true for r; next 
proving that if (34), (35) and (36) hold true up to r then (37) and (38) are true for r. 

Let r > and put r — 1 in place of r and r in place of s in (37) and (38) . Recalling 
that only + ^'^ is used in order to determine Xr use the definition of the 

norm, the form of the solution of eq. (10) discussed in sect. 2, and the estimate (25). 
This immediately shows that (34), (35) and (36) are true for r provided (37) and (38) 
hold true for r — 1. Coming to (37) and (38) and recalling the recursive definitions (11) 
there are only two kinds of terms to be estimated, namely ^L s Xr Z m for 1 < m < r and 

jjL^H^ s _ p y +rn for < p < s and < m < r. For, remarking that Z r and Hr 

1 1 (r) 1 1 

are estimated by exactly the same quantity it is safe to estimate \\H sr '\\ by replacing 

Z r with Hr r in the second of (11). This is tantamount to extending the sum in the 
second of (11) to p = s — 1 and making it identical with the sum in the first of (11), 
with m = 0. 

Denote ip s = L s Xr Z m , where r > 1, and split (p s = (p\ + (p\ + (p^ s . I claim 

2( S -1) 



(40) Up s l_,<sl — ||x, 



,s-l 



(41) llrf + rillx^^^lx) 2 ^"^"^^. 
for s > 1, where 

(42) D = /i r _i jr /i m _i )m T' r)r _|_ m C E 

The proof proceeds by induction. Let s = 1. By the general estimate (26) one has 

Using (34) and (35) one gets 

II II < J_ 1 rp rp E 2 

^ISr S d /m-l,m^r-l,r m-l.m^ r-l,r ^ ^ 2 , 

so that (40) immediately follows from (31), (32) and (39) 

Still keeping s = 1, (41) is obtained by remarking that the contributions to (p\ + (p\ 

h 

come only from L Xr (Z^ n + Z^) and (L ttZ^) . Proceeding as above, from (28) and (27) 
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one gets 

ll^ + ^ll 1 - < ^^ll^ r " 1,J + ^- 1) '1l 1 - fc -Jl z - 



m 



1-5. 



\Xr A\^ r ' 



d m A2» Ar »l-*r-lll" m lll-* ra -l • 

Then (41) for s = 1 follows from (35), (38) for r — 1 and (39). Remark that the divisor 
d r does not appear here. 

Let now s > 1, and assume that (40) be true up to s — 1. Recalling that the divisor 
d r due to the generalized Cauchy estimates is arbitrary, replace d r with ^j^d r in the 
estimates (40) and (41) for (p s -i, thus getting 

(S \ ^ / \ 2(s— 2) j-y 

t^t ) UaJ hAr -^d- r ' 

' 2s " 4 / 2e \ 2{s - 2 \, 1|S _ 2 



Consider first the estimate (41). Remarking that the contributions to <p\ + <p\ come only 
from L Xr ( < ipl_ 1 + and [L h<£s_i)\ use (26) and (27) to estimate 



< 



s 



^2^2 ll^lll-^-! Il^s-l + Ps-l\\i-S r+ d r / 

2s I, || || || 



Replacing (43) in the latter expression one gets 



II* 5 + rf Hi-'. - d*A* ) [d^K ) \M\l-l_D , 

so that (41) follows from the trivial inequality (^ry) S < e. The estimate (40) is 
checked with a similar calculation, just taking into account that (26) must be used 
in order to estimate L Xr (p 8 -\. This produces an extra divisor d r with respect to the 
calculation above. 

Finally, replace (34) and (42) in (40) and (41). Using also (39), one gets 

s-l 



s\ ( s V / 2e Y {S - Z > „ l|S -i 



/ 1 \ E 



Using s — 1 times the inequalities (31) and (32) one easily gets 

i y _1 /i y -2 

-^V— 1,9" J 2~r,r+m — \^^2" ~^~' r — 1,9" J -^r,2r+m — • • • — -^r,ar+m 
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Thus one concludes 

1 E 

(44) — || L Xr Z m || ^_g^ < (J> r — l jr fJ>m — l,mT r ,sr+mC — , 

"H^Xr^) + (^Xr-^ m ) II 1-5,, — l J ''r-l,rf J 'm-l,mTr, S r+mC Sr+ ' m 1 E. 

The estimate for -^L£ i7^ r ^ . is a minor variazione of the scheme above. Only 

(r—1) 

the first step must be omitted. E.g., set (p p = L r ^ r H^ s _ p y +m and proceed as follows. 
Using (37) for r — 1 get 

Win II < ii T rysr+m — 1 rp . 

\\rO\\i_g r _ 1 — h L r—l,sr-\-m- L r,sr-\-m K - y J -' i 

this starts the induction on p. Then proceed for p > as above. The conclusion is 

( 46 ) p~\ \\ L Xr H (s-p)r+m\\l-S r - i xV r-l,ri J 'r-l,(s- V )r+ m Tr,sr+mC SrJrm 1 — , 

— f J 'r — l,rl J 'r—l,(s—p)r+mTr,sr+mC -E . 

Collecting (44), (45), (46) and (47) and referring to the transformation formula? (11) 
it is now an easy matter to verify that (37) and (38) hold true provided the sequence 
/i rjS for < r < s is defined as 

Ho, s =1 for s > , 

s-l 

fJ>r,sr+m = A*r— l,rA*m— l,m "I" ^ ] AV— l,^) — l,(a— p)r+ro 

p=0 

for r > 2, s > 1, 1 < m < r . 

s-l 

^r,sr = ^/^-l,rMr-l,( S -p)r for T > 1, S > 2 . 

p=0 



(48) 



This looks quite different from (33). However, I claim that (33) is just a harmless 
extension of (48). Indeed, just redefine the indexes by replacing sr + m with s, also 
accepting s > 0, which removes the implicit restriction s > r. This is harmless, be- 
cause for s < r one gets for (33) fi r , s = jii r _i )S . Therefore, in the second line one can 
replace ^ m -i,m = A*r-i,m and include it into the sum. This completes the proof of 
lemma 3. Q.E.D. 

3.4 Completion of the proof of lemma 1 

The statement of the lemma concerns only the sequence of generating functions, that 
are estimated by (34). The completion of the proof rests on a suitable choice of the 
sequence {d r } r >i, that was left arbitrary, and on a suitable estimate of the sequence 
{n>r,s}s>r>o- As a matter of fact, only the diagonal elements of the latter sequence need 
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to be estimated, because the estimate for the generating functions in lemma 3 involves 

Only Hr-l,r = Vr,r- 

First, pick a value for d, with < d < 1/2, and set 

A -A J,-** 

so that J2 r >i dr = d in view of J2 r>l 1/r 2 = 7r 2 /6. The immediate consequence is that 

(1 c \ s — 1 

For, use the definition (30) and recall the definition (29) of J r , s \ then let J G j7~ rjS and 
evaluate 

Dz - w^) II -? 2 ' 

jeJ J jeJ 
because #(J) < 2(s — 1). On the other hand one has 

iog 2 n^ 2=2 S i °g2i< 4 ( s - 1 ) • 

jeJ jeJ 

This proves (49) 

Coming to the sequence (33), the problem is to show that /J, r -i, r < <7 r_1 for some 
positive rj. For, only fi r -i,r enters the estimate (34). By separating the term p = in 
the sum one gets 

/*r,s A*r— l,s /*r— l,r ^ ^ [^ r —\^ r [^r — \,s — r—rq 

0<q<s—r 

/*r— l,s "I - A*r— l,rA*r,s — r • 

Putting r — 1 in place of r and r in place of s in the latter formula one gets 

|W r -l,r = H>r-2,r + /^t— 2,r— lA*r— 1,1 

= ^r-3,r + ^7—3,7—2/^7—2,2 + ^7—2,7—1/^7—1,1 

< /i 0)7 - + ^0,1/^1,7—1 + • • • ^7—2,7—1/^—1,1 

< ^0,1/^7—2,7—1 + • • • + /^r— 2,7— lAk),l • 

The last inequality requires a justification. Just use 

/^0,r < AH,r < • • • < /J>r-l,r = /^r,r = • • • j 

which is an immediate consequence of the definition. Then, for r — j > j substitute 

Hr-j,j = Vj-iji an d for r — j < j use fi r -j,j < flj-iji an d so also fi r -j,j + 1 < Vj-iJ 
which gets rid of the extra term no jT . = 1. 
Thus, the sequence {h> r }r>i defined as 

r-l 

(50) i/i = l, z/ r = VjV r -j for r > 1 

i=i 
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is a majorant of {^ r -i,r}r>i- This is known as the Catalan's sequence, and one has 

(51) v r = [ - } - < 4T- 1 , 

where the common notation (2n + 1)!! = 1 • 3 • . . . • (2n + 1) has been used. 

Thus, we conclude that fi r -i,r < 4 r_1 . Inserting the latter inequality and (49) 
in (34) the statement of lemma 1 follows. 



4. Proof of theorem 1 

Having established the estimate of lemma 1 on the sequence of generating functions it 
is now a standard matter to complete the proof of theorem 1. Hence this section will be 
less detailed with respect to the previous ones. 

The situation to be dealt with is the following. An infinite sequence {Xr}r>i of 
generating functions is given, with Xr £ V r +2 (a homogeneous polynomial of degree 
r + 2) satisfying ||Xr||_R < /3 r ~ 1 G for some real vector R with positive components and 
some positive j3 and G. Define a corresponding sequence of canonical transformations 
(x^ r ~ 1 \y^ r ~ 1 ') = ex.p(L Xr )(x( r \y( r ^). By composition one also constructs a sequence 
{C^} r >o of canonical transformations (x^°\ y(°>) = {x^ r \ y^) recursively defined 
as 

C (0) = Id , C {r) = exp(L Xr ) o C (r " x) , 

Id denoting the identity operator. The problem is to prove the following statements. 

(i) Every near the identity canonical transformation defined via the exponential oper- 
ator exp(L Xr ) is expressed as a power series which is convergent in a poly disk A g n 
for some positive g. 

(ii) For any function f(x^ r ~ 1 \ y^ r ~ 1 ^) analytic in A g R the transformed function is an- 
alytic in the same polydisk, and moreover 



fixf-W-V) (t = [exp(L x J/](*M yM) . 

(iii) The sequence |C^ r - ) } r>0 of canonical transformations converges for r — > oo to a 

canonical transformation which is analytic in a polydisk A^i_ d ^ gR for some 
positive d < 1/2. 

(iv) For any function / analytic in the sequence recursively defined as = /, 
f( r ) = exp(L Xr )/( r_1 ) converges for r — > oo to a function /(°°) that is analytic in 
A(i-d)gR, an d moreover one has 

/(°°) = /oCH . 

The statement (i) actually reduces to Cauchy's proof of the existence and uniqueness of 
the local solution of an analytic system of differential equations. For, the transformation 
defined via the exponential operator is the time-one canonical flow induced by the 
Hamiltonian vector field generated by Xr- The statement (ii) actually claims that the 
substitution of variables in a function / may be effectively replaced by the application of 
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the exponential operator to /; this is indeed the basis of the algorithm for constructing 
the normal form used in sect. 2. A detailed proof of both these statements may be found, 
e.g., in [9]; however, the reader may be able to reconstruct the proof by following the 
hints in [3]. 

The proof of (iii) rests on the following remarks. In the polydisk A e ^ one has 
\Xr(x,y)\ < Q r+2 \\Xr\\ Q Ri this, in turn, implies that — a^ -1 )) ~ /3 r ~ 1 g r+2 and 

_ y( r-1 )| ~ j3 r ~ 1 Q r+2 . The geometric bound on the latter quantities implies that 

^2r>i\ x ^ ~ ^ an d ^2 r> i\y^ — y^ r \ behave like geometric series, i.e., con- 

verge for g small enough. Thus, the claim follows from Weierstrass theorem. Finally, 
the statement (iv) follows from (ii) being true for all r > 0, which implies that both 
sequences 

f(r) = Q(r)f and f Q(r) converge to the 

same limit. This concludes the proof 

of theorem 1. 



A. Justification of the normalization algorithm 



Justifying the normalization algorithm of sect. 2 is just matter of rearranging terms in 
the expansion of exp(L Xr )i7^ r_1 ). Considering first H and H^ r ^ together, one has 

exp(L x J (H + Hp-V) = H +L Xr H + £ -.L* X H Q 

s>2 



s>l 



Here, Ho is the first term in the transformed Hamiltonian H^ in (9). In view of (10) one 
has L Xr H + Hi- = Z r , which kills the unwanted term H r and replaces it with 
the normalized term Z r . The two sums may be collected and simplified by calculating 



s>2 



s>l 



s>2 v ' L 

Here, both L s - r l Z r and L?£ L H? ' are homogeneous polynomials of degree sr + 2, that 

are added to H^r in the second of (11). 

Proceed now by transforming the functions Zi, . . . , Z r -\ that are already in normal 
form. Recall that no such term exists for r = 1. For r > 1 calculate 



exp 



(L Xr )Z m = Z m + ^2 -] L x-r Z ™ > for 1 < m < r . 

S • 



S>1 
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The term Z m is copied into in (9). The term L s Xr Z m is a homogeneous polynomial 

of degree sr + m + 2 that is added to H^} +m in the first of (11). 

Finally, consider all terms Hg T ^ with s > r, that may be written as H^ + ^j with 
I > 1 and < m < r, the case I = 1 , m = being excluded. One gets 

exp(L x j4 r i } = E >xX7™ } 
p>0 ^' 

where L^H^,^ is a homogeneous polynomial of degree (p + l)r + m. Collecting all 
homogeneous terms with m = 0, / > 2 and p + 1 = s > 2 one gets Y]l-n ^<LZ aY X \ 

o ) — r — o ^^p = (j p\ Xr (s-p)r' 

that is added to in the second of (11). Similarly, collecting all homogeneous terms 
with < m < r, / > 1 and p + 1 = s > 1 one gets i A-kS #/ r \ _l ■, that is added 

to Hgr+m in the first of (11). The latter case does not occur for r = 1. This completes 
the justification of the formal algorithm. 



B. Technical calculations 

The aim is to check the estimates (26), (27) and (28). Write, generically, x 
52j,k c i,k xJ y k and / = T,j, k fj,kX J y k . Then compute 

(52) L xf = £ Y, 3jh ^ c ^^ +f v k+k '■ 

j,k,j>,k> i=i m 
Using the definition of norm evaluate 

l|£x/lli-*< E E m ~ m \ Cj , k \ \frM(i - 8)^ k ^ +k '\-2 Rj+k+ r^ 
j,k,j',k> i=i 1 

* xs E E E » -3M\ i^i ((i - *) - v - s'))^-^ 
j,k j',k> i=i 

x|^^i((i-n-(^-n) l/+fe ' hl ^" +fc ' • 

If / is a generic function, then in view of \ji\ < \j + k\ and k\ < \j + k\ one has 

n n 

(53) ^ \j[h - 3 ik[\ < \j + k\ \j'l + k'i\ = \j + k\ ■ \j' + k'\ . 
i=i i=i 

Replacing in the estimate above and using the elementary inequality 

(54) m(A - x)™' 1 < — for < x < A and m > 1 
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one gets 

\\Lxfh-s < ^ E l c ^l \i + k \ K 1 " 5 ') ~ ( 5 ~ lj+khl R j+k 

x E i/i'.*' i \f + k '\ ((! - o - (* - o) ]j,+k ' ] - 1 w ,+k ' 

(55) J'"' 



(5-5')(5-5")A 2 A 



xEi/^'Ki-n 1 ^' 1 ^"" 

from which (26) immediately follows in view of the definition of the norm. 

In order to prove (27) recall that x^ contains only monomials Cj^y k with j + k E 

b 

K^. The projection (L^f^ is just part of the general expression (52). In particular 
the value / = 1 in the sum must be discarded because the resulting monomials belong 
to V b . Moreover, for k £ one has ^]\=2 \ji + ki\ < 2. Thus the estimate (53) may be 
replaced by 

n n 

(56) E w** - M < 2 E w + k 'i\ = ^' + k '\ ■ 

1=2 1=2 

Hence the inequality (54) must be used only for the term involving \j' + k'\, and there 
is no need to introduce the divisor 6 — 5' . Use instead 1/(1 — 5) < 2 in view of 5 < 1/2. 

Coming finally to (28), replace / in the general expression (52) by Z* + = 
X^eACttuK^ z v,v% v y v ■ Recall also that the coefficients Cj^ of x have the form Cj^ = 
(k-f\) ' m v i ew °f (13)- Then (53) may be replaced by 

(57) E Ml ~ h)\ < M E \3i ~ k i\ < M \j - k \ 

i i 

On the other hand, by lemma 2 one has 

\4>j,k\ 



+k' 



\Cj,k\ 



< 



\j - fe|7 ' 

so that the factor \ j—k\ in (57) is compensated by the divisor here. This removes the need 
to introduce the divisor 5 — 5' in the rest of the estimates. Use instead (1 — <5)^'+ fc l -1 < 
2(1 — 5)l- J ' +fe l, which holds true in view of 5 < 1/2. Then (55) is replaced by 

\\l x (z* + z^-s < ^ E C 1 " V lj+kl R j+k 

x j2 i^iiH((i-n-^-n) 2H_1 ^ 
< ^_^ A2 Efeia-^) lj+fc| ^ +fc E i^i(i -n 2H ^ • 
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Thus (28) follows in view of the definition of the norm. 
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